Repost One word reply from customer not recognized and speed of voice Vapi AI #support

Join Discord

Repost One word reply from customer not recognized...

# support

stefan

09/09/2024, 2:46 AM

Please have a look at my original post, your chat bot locked and marked it as Solved. But its not.

Vapi

09/09/2024, 2:47 AM

@stefan I'm sorry to hear that your issue wasn't resolved. Could you please provide me with more details about your original post and the problem you're experiencing? I'm here to help you get it sorted out.

Vapi

09/09/2024, 2:47 AM

You can continue this conversation by mentioning me in the message.

stefan

09/09/2024, 2:49 AM

Can a human look in to this? Your chat bot already tried and failed.

Shubham Bajaj

09/09/2024, 5:28 PM

can you edit this ticket with your issue and tag me here again.

stefan

09/10/2024, 2:44 AM

@Shubham Bajaj When the customer gives a one word answer, its not recognized, except for the fist time. Swedish language only! AI will ask (in Swedish) Is this Matias I'm speaking to? If customer replies: "Yes" nothing will happen and the call will eventually time out. As in this example: Call ID: 95997d6d-3ce5-467b-bc33-0b28a12a7bfd If the customer however replies: "Yes it is" It works as it should. However it works in Swedish if the one worded reply is after the "First Message" in the prompt. From the log: Assistant Hej, jag heter Joakim och ringer från Bra hjälp, vi som hjälper utsatta barn. User Hej ** ** one word reply recognized * Assistant Är det Mattias jag pratar med? * I replied Yes, but not recognized. * Can you also let me know what to write in the "path id string" and where to get it. See picture. Thanks Stefan https://cdn.discordapp.com/attachments/1282532613659295814/1282894457217355879/vapi3.jpg?ex=66e10411&is=66dfb291&hm=496c3ee0c95b2126b37653b859a2541daacb1d036667358d31e3257595b92591&

stefan

09/10/2024, 4:49 PM

@Shubham Bajaj any update?

Leobaldo Alcantara

09/10/2024, 8:32 PM

Hi, I'm an Vapi user here. I detected the same problem with my calls. I'm openning an new thread about it. About your question: The id can be found in the assistant page, top of page, below the assistant name.

Shubham Bajaj

09/10/2024, 9:31 PM

hey @stefan can you check call id again there was no response from user after assistant voiced out

Is this Mattias I’m speaking with?

stefan

09/11/2024, 4:12 AM

@Shubham Bajaj @Leobaldo Alcantara As I stated in my post, there was as response. I replied Yes, but not recognized/transcribed. This is not just a one time fail. I have tried MANY times, it does not work. Seems like its not just for me as Leobaldo is reporting the same issue. Thanks Leobaldo for your input!

Shubham Bajaj

09/11/2024, 7:36 PM

cam you share call ids where it's captured your input(as transcript) even in call recording i can help.

stefan

09/12/2024, 4:20 AM

@Shubham Bajaj Im sorry but are you reading my posts at all? As I wrote in my first message on September 8, It works after the "first message" I also provided a copy of the logs. Here it is again. From the log: Assistant Hej, jag heter Joakim och ringer från Bra hjälp, vi som hjälper utsatta barn. User Hej * * *********** **one word reply recognized ************** Assistant Är det Mattias jag pratar med? ** I replied Yes, but not recognized. ******** Same call ID as provided previously: 95997d6d-3ce5-467b-bc33-0b28a12a7bfd

Shubham Bajaj

09/12/2024, 8:14 PM

you right after this 🔵 10:20:24:994

assistant

Final Transcript : Är det Mattias jag pratar med?: 0.9577637 your words were not captured, can you try w/o bg noise because your words post capturing were kinda removed/filtered.

Shubham Bajaj

09/12/2024, 8:14 PM

@stefan try again w/o being in closed environment.

Shubham Bajaj

09/12/2024, 8:14 PM

let me know how it goes.

stefan

09/13/2024, 12:00 PM

@Shubham Bajaj I tried many times before, it was not a one time fail. But have since modified the assistant and for the moment it is working. I do have three other issues. 1. I tried to use Its not doing anything. I have enabled ssml parsing via "Update assistant" Call ID: ee3ac508-a34e-451f-a551-e4f62b8a1412 2. I tried to use emotions, but the same issue. It doesn't matter what i write, no change in the voice. https://elevenlabs.io/docs/speech-synthesis/prompting 3. Im using a Swedish voice, (have tried with several, even made my own clone at elevenlabs.) In every single call the voice changes dialect after a while, it sounds like its a foreigner trying to speak Swedish. Some words are to the point it cant be recognized. Thanks!

Shubham Bajaj

09/13/2024, 2:33 PM

Regarding your mentioned issues: [1] we have sent the same to 11labs what you have given to us 🔵 11:50:55:930 Voice Input Formatted: "vi som hjälper utsatta barn. Anledningen till att jag ringer är att snart är sommaren slut.", Original: "vi som hjälper utsatta barn.. Anledningen till att jag ringer är att snart är sommaren slut." 🔵 11:50:55:930 ElevenLabs (Websocket #0) Pushing 110... "vi som hjälper utsatta barn. Anledningen till att jag ringer är att snart är sommaren slut." 🔵 11:50:55:930 [user LOG] Voice input: vi som hjälper utsatta barn. Anledningen till att jag ringer är att snart är sommaren slut. [2] regarding emotionRecognitionEnabled i will say it's not perfect as currently it uses the transcription and llm model output so chances you may not get consistent results [3] when we stream voices during calls, the difference is noticable because of streaming and meeting latency SLA. what I can suggest is to use default values for better consistency. "model": "eleven_turbo_v2", "style": 0.9, //upto your requirement and expectations. "voiceId": "wIdBMZsynaZq6gk0sjXJ", "provider": "11labs", "stability": 0.5, "similarityBoost": 0.75, "useSpeakerBoost": false, "enableSsmlParsing": true, "fillerInjectionEnabled": false, "optimizeStreamingLatency": 2

Shubham Bajaj

09/13/2024, 2:33 PM

@stefan anything else i can help with?

stefan

09/16/2024, 9:05 AM

@Shubham Bajaj I've been doing additional testing and am experiencing significant delays in response times. When I test your assistant on www.vapi.ai I consistently get excellent response times—between 500 ms and 1000 ms at most. However, the two calls below are showing such delays that it's becoming unusable. Please look into this issue as soon as possible. 1500 to 4000 ms is just not cutting it. This call for example: 82770825-0962-473f-adb2-de1ac688feaf Customer says Hello 4 seconds of silence! Customer says Hello again 1.5 seconds of silence Bot finally starts to speak….. Another example: 25c5cebf-5c73-4b60-989d-2420e77ed6d9 Customer says his name. 1.5 Seconds silence Bot: asking for a specific person. Customer: you have reached the wrong person 4.5 seconds of silence Bot finally starts to speak….. I made a new assistant (your preconfigured AVA), standard settings except for gpt4o. Its a bit better, but still not as good as the respons times at www.vapi.ai b304e05c-b8ab-4e24-8652-d3f2dcf8640e Been testing some more, the delay seems get a lot worse when using non English languages. I would really appreciate a solution to this, even if it means having to upgrade to Enterprise/custom plan etc.

Shubham Bajaj

09/16/2024, 4:18 PM

for the second call id

25c5cebf-5c73-4b60-989d-2420e77ed6d9

the user input was > Nej, då har det ringt fel. Har kommit till Anders. Lindblad. If you notice the user has spoken in 3 utterances because of this you observed a delay in response.

Shubham Bajaj

09/16/2024, 4:22 PM

for the first call, yes first user message was missed exactly i couldn't from whom side it is but i suggest adding idle messages even in case it's missed from system side still a message will be sent out to the user.

Shubham Bajaj

09/16/2024, 4:23 PM

@stefan if you have recent example of calls of type first call id then do share will create an issue for it.

stefan

09/18/2024, 10:06 AM

@Shubham Bajaj Hello again, I just made a new test call. 65f9186d-00dd-436c-81eb-5ebd04f89904 It took the bot 4 seconds to reply after I answered with my name. After said "its me" I waited 8 seconds, no reply from the bot. After I said "hello its me" it took 2.5 seconds before the bot replied. Sorry, but not really usable in the real world. Is there anything I can do to get it down to sub 1 second as it is on your own test bot? Thanks!

Shubham Bajaj

09/18/2024, 1:22 PM

You said 00-02 secs: Jag Bot replied started from 06 secs (latency of 03 secs): Hej Jag heter David, Jag söker Michael 🔵 09:52:16:600

user

Final Transcript : Jag: 0.7084961 🔵 09:52:17:718 Getting Speech For

11labs:wIdBMZsynaZq6gk0sjXJ:1:0:0.9:false:4:eleven_turbo_v2_5:16000:be1c2ae015703d72ac7e897280c69e20eae48cf12acb8666614c52b49cc4e794

, "39" Text: "Hej Jag heter David, Jag söker Michael" As visible from logs the llm generated response in require time, voicing out takes time.

Shubham Bajaj

09/18/2024, 1:23 PM

Now coming to sub second latency can not be committed as of now.

Shubham Bajaj

09/18/2024, 1:27 PM

more

🔵 09:52:29:681 Idle Timeout Triggered But No Idle Message. 🔵 09:52:30:090

user

Partial Transcript : Hallå: 0.7763672 🔵 09:52:31:039

user

Partial Transcript : Hallå, det var jag.: 0.80249023 🔵 09:52:31:041 [user CHECKPOINT] Model request started 🔵 09:52:31:300

user

Final Transcript : Hallå, det var jag.: 0.9016113 🔵 09:52:31:305 [user LOG] Model request started (attempt #1, gpt-4o-mini-2024-07-18, azure-openai, eastus) 🔵 09:52:31:874 ElevenLabs (Websocket #1) Pushing 69... "Hej Michael, som jag sa så heter jag David och ringer från Bra Hjälp," 🔵 09:52:32:019 ElevenLabs (Websocket #1) Pushing 91... "vi som hjälper utsatta barn. Anledningen till att jag ringer är att snart är sommaren slut." 🔵 09:52:32:154 ElevenLabs (Websocket #1) Pushing 55... "Och då är det dags för nya insatser för dom här barnen." ... 🔵 09:52:32:770 ElevenLabs (WebSocket #1) First Audio Message Received. Took 895ms.

Shubham Bajaj

09/18/2024, 1:28 PM

As it's clear forst voice input it took 895ms and similary for others.

2 Views

Previous Next